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A new thresholding method, based on ^-statistics and called or- 
der thresholding, is proposed as a technique for improving the power 
when testing against high-dimensional alternatives. The new method 
allows great flexibility in the choice of the threshold parameter. This 
results in improved power over the soft and hard thresholding meth- 
ods. Moreover, order thresholding is not restricted to the normal 
distribution. An extension of the basic order threshold statistic to 
high-dimensional ANOVA is presented. The performance of the basic 
order threshold statistic and its extension is evaluated with extensive 
simulations. 

1. Introduction. It is well known that, when testing against a high- 
dimensional alternative, omnibus tests designed to detect any departure 
from the null hypothesis have low power. Neyman's (1937) truncation idea, 
though motivated by a different type of problem, served as the spring board 
for the development of modern related approaches. Soft and hard thresh- 
olding were introduced in the context of nonparametric function estimation 
using wavelets by Donoho and Johnstone (1994). Johnstone and Silverman 
(2004) elaborate on a number of additional applications of thresholding in- 
cluding image processing, model selection, and data mining. Beran (2004) 
considered applications to the one-way ANOVA design. Spokoiny (1996), 
Fan (1996) and Fan and Lin (1998) consider applications of thresholding 
methods to testing problems. Fan (1996) found that hard thresholding out- 
performs both soft thresholding and adaptive Neyman's truncation. 

This paper proposes a new thresholding method based on L-statistics, 
which is termed order thresholding. Order thresholding allows great flexibil- 
ity in the choice of the threshold parameter, can be used for distributions 
other than the normal, and extends naturally to factorial design settings. 
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In the simple context where the Xi are independent N(9i, 1), i = 1, . . . ,n, 
and we wish to test Hq : 9\ = • ■ ■ = 9 n = vs. H a :9i^0, for some i, the hard 
thresholding and order thresholding test statistics are, respectively, 

n n 

(1.1) T H (S n )=J2 Y i I {Yi>S n } and T L (k n ) = Y^ 

i=l i=l 

where Yi = Xf, Y^ n < ■ ■ ■ < Y nyTl are the ordered Y^s, c in = I(i > n k n ) , 
and 5 n , k n are the corresponding threshold parameters. Thus, Ti,{k n ) is an 
L-statistic based on the largest k n squared observations. Conceptually, the 
connection between hard thresholding and order thresholding is similar to 
that between type I and type II censoring. The main difference being that 
the threshold parameters in type I and type II censoring (the cut-off point 
and the proportion of observations included, resp.) remain fixed, while in the 
present case they change with the sample size. As we will see, this distinction 
implies very different asymptotic behavior. 

The idea behind both statistics in (1.1) is similar to that of Neyman's 
truncation. Namely, when the "signal" is known to be concentrated in a few 
locations, the accumulation of stochastic errors has a negative impact on 
the performance of the procedure based on the chi-square statistic 

n 

(1.2) T L {n) = Y J Y l - 

i=l 

Since the signal locations are not known, the statistics in (1.1) attempt to 
minimize the accumulation of noise by focusing on the observations with 
the largest absolute values. The asymptotic theory for hard thresholding 
[Fan (1996)] requires several restrictive conditions that prevent its general 
applicability. For example, the centering and scaling of Tf/(<5 n ) in (1.1) are 
specific to the normality assumption and to the choice of 8 n . Moreover, 8 n 
is required to tend to infinity at a rate that is specific to the normality 
assumption. For example, 5 n tending to infinity is clearly not appropriate if 
the Xi have bounded support. (Below, we discuss an application to multiple 
testing where the Xi are uniformly distributed.) Intuitively, if the signal 
is present in more locations, it is advantageous to lower the value of the 
hard threshold parameter. The advantage of allowing different values of the 
threshold parameter is amply illustrated in Johnstone and Silverman (2004). 
However, the asymptotic theory of Tjj(S n ) requires the threshold parameter 
to tend to infinity at specific rates. In particular, it must be of the form 
8 n = 21og(na n ), where a n = c(logn)~ d , for c > and d > 0.5. Thus, if we 
let kni^n) denote the random number of observations considered in Tf/(5 n ), 
the asymptotic theory of Tjj{5 n ) requires E[kH(S n )] to converge to infinity 
at the rate of 

(logn) d 
A/logn + dlog^/^logra) -1 ) 
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Table 1 

Type I errors ofTH(S„) for different values of the hard threshold parameter 







S n - 2.0 


S n - 1.6 


S n - 1.2 


S n - 0.8 


S n - 0.4 




n 


= 50 


0.0003 


0.0099 


0.0231 


0.0341 


0.0431 


0.0493 


n 


= 100 


0.0101 


0.0229 


0.0324 


0.0390 


0.0461 


0.0504 


n 


= 200 


0.0231 


0.0316 


0.0382 


0.0439 


0.0484 


0.0507 


n 


= 500 


0.0327 


0.0388 


0.0422 


0.0465 


0.0502 


0.0535 



or, roughly, (logn) rf ~ ' 5 . In contrast, the asymptotic theory of T^kn) allows 
the threshold parameter k n to tend to infinity at any rate. 

While the asymptotic theory of T#(<5 n ) allows some flexibility in the choice 
of 8 n , the convergence of the distribution of T#(5 n ) to its limiting distribu- 
tion is very slow unless c = 1 and d = 2 [Fan (1996)]. The following tables 
show that small departures in the recommended value of d, while keeping 
c = 1, have significant effect on the level of the test. The results are based 
on 30,000 simulation runs. 

To fully appreciate the results reported in Table 1, we mention that for 
n = 500 the recommended £500 value is 5.1216, while the value <5soo — 2 corre- 
sponds to c = 1 and d = 2.5474. We see that even with this small departure 
from the recommended value, the achieved alpha level is 0.0327 even with 
n = 500. To contrast these results with those of Table 3 for Ti(& n ), note 
that in Table 3 with n = 500, k n ranges from 2 to 500, while in Table 1 with 
n = 500, the E[k H (5 n + h)] ranges from 38.63 for h = -2.0 to 11.81 for h = 0. 
Thus, the deterioration of the achieved alpha levels occurs as £7 in- 
creases over a relatively small range (in each case, the variance of kjj(S n ) 
is slightly smaller than its expected value). In Table 2 with n = 500, the 
E[k H (5 n + h)] ranges from 9.39 for h = 0.4 to 3.80 for h = 2.0, and for this 
range of values the type I error rate does not change much. In both tables 
with n = 500, the variance of the binomial random variable ku(S n + h) is 
slightly smaller than its expected value because P{Yi < + h) > 0.92 for 
—2.0 < h < 2.0. Finally, following a remark by the AE, we note that the 
slightly liberal a levels of the Ti,(k n ) statistic can be corrected by the use of 



Table 2 

Type I errors ofTH(5 n ) for different values of the hard threshold parameter 







8 n + 0.4 


8 n + 0.8 


8 n + 1.2 


8 n + 1.6 


6 n + 2.0 


n 


= 50 


0.0543 


0.0588 


0.0614 


0.0654 


0.0663 


n 


= 100 


0.0552 


0.0559 


0.0597 


0.0616 


0.0631 


n 


= 200 


0.0539 


0.0562 


0.0590 


0.0601 


0.0627 


n 


= 500 


0.0540 


0.0563 


0.0583 


0.0604 


0.0623 
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Table 3 

Type I errors ofTz,(k n ) for different values of the order threshold parameter 







[log 1 / 2 n] 


[log n] 


[log 3/2 n] 


[n 1/2 ] 


[n 2/3 ] 


[n 3/4 ] 


K /8 ] 


n 


n = 


50 


0.0696 


0.0685 


0.0646 


0.0646 


0.0635 


0.0630 


0.0623 


0.0626 


n = 


100 


0.0669 


0.0640 


0.0606 


0.0600 


0.0591 


0.0577 


0.0585 


0.0589 


n = 


200 


0.0667 


0.0620 


0.0603 


0.0589 


0.0582 


0.0583 


0.0555 


0.0560 


n = 


500 


0.0665 


0.0631 


0.0577 


0.0559 


0.0536 


0.0536 


0.0535 


0.0547 



a multiple of a x 2 distribution to approximate its finite sample distribution. 
Thus, using the approximation Ti(/c n ) ~ bx„, where b and v are chosen to 
match the mean and variance of Tf,(A; n ), results in the type I error rates 
shown in Table 4. 

The greater flexibility in the choice of the threshold parameter that the 
order threshold statistic offers does not come at the expense of the rate 
with which it converges to its asymptotic distribution. To emphasize this 
aspect, Figure 1 presents the estimated densities of the hard thresholding 
(solid lines in the upper panel) and order thresholding test statistics (solid 
lines in the lower panel), based on 20,000 simulated values of each statis- 
tic using n = 200. The threshold parameters of the hard thresholding and 
order thresholding test statistics have been chosen so that the average num- 
ber of observations included in the two statistics are the same in each col- 
umn. We see that the estimated densities of the order threshold statistic are 
closer to the standard normal density (dash-dot line) than those of the hard 
threshold statistic. In particular, the estimated densities in the upper panel 
show the rapid deterioration of the quality of the normal approximation to 
the distribution of Th (S200) as £200 shifts away from recommended value of 
<5 20 o = 21og(200 log" 2 200) = 3.9271. 

The remaining sections of this paper are organized as follows. In Section 
2, we represent a special form of the order statistics using data from an 
exponential distribution and briefly review the methodology of Chernoff, 
Gastwirth and Johns (1967). Section 3 develops the order threshold proce- 
dure for testing normal means in settings where the number of parameters 



Table 4 

Type I error rates using the approximation T^{k n ) ~ bxt 







[log 1/2 n] 


[log n] 


[log 3 / 2 n] 


[nV»] 


[n 2/3 ] 


[n 3/4 ] 




11 


n — 


50 


0.0565 


0.0545 


0.0531 


0.0531 


0.0532 


0.0534 


0.0540 


0.0546 


n = 


100 


0.0566 


0.0540 


0.0520 


0.0522 


0.0519 


0.0507 


0.0518 


0.0520 


n = 


200 


0.0555 


0.0547 


0.0536 


0.0531 


0.0526 


0.0523 


0.0530 


0.0516 


n = 


500 


0.0589 


0.0556 


0.0552 


0.0530 


0.0520 


0.0521 


0.0508 


0.0505 
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Fig. 1. Top panel: estimated densities of Th (feoo) for S 2 oo = 1.842, 3.927 and 5.672. 
Bottom panel: estimated densities o/ Ti,(fc2oo) f or feoo =35, 10 and 3. 

increases with the sample size, presents simulation results comparing the 
hard thresholding, a power-enhanced version of the Simes (1986), and order 
thresholding test statistics, and gives a recommendation for choosing the 
data-driven value of the threshold parameter. Section 4 extends the order 
thresholding test procedure to the high-dimensional AN OVA setting [called 
HANOVA in Fan and Lin (1998)], presents simulation results comparing 
the power of the classical F and order threshold statistics, and gives a rec- 
ommendation for a data-driven choice of the order threshold parameter. A 
discussion summarizing the developments is given in Section 5. Finally, the 
condensed proofs are given in the Appendix. For detailed proofs, see the 
archived supplemental material in Kim and Akritas (2010). This is part of 
the Ph.D. dissertation of the first author. 

2. From order statistics to order thresholding: An overview. In the late 
1960s when the asymptotic theory of linear combinations of order statis- 
tics (L-statistics) was developed [cf. Bickel (1967), Chernoff, Gastwirth and 
Johns (1967), Shorack (1969), Stigler (1969)] the main emphasis was in the 
estimation of the location parameter. Therefore, the conditions in these pa- 
pers do not yield automatically the asymptotic distribution of L-statistics 
that assign positive weight to only the largest order statistics. Such L- 
statistics were considered by Nagaraja (1982) in his study of the selection dif- 
ferential for applications to outlier detection. Using results from Hall (1978) 
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and Stigler (1973), he obtained the asymptotic distribution in the extreme 
and quantile cases, respectively. Here, we will use the conditions from the pa- 
per of Chernoff, Gastwirth and Johns (1967), CGJ1967 from now on. Their 
approach is based on a special representation of the order statistics from the 
exponential distribution, which we now review. 

Let Vi,...,V n be i.i.d. from the standard exponential distribution, let 
Vi tn < ■ ■ ■ < V n ,n be the corresponding order statistics, and consider the order 
threshold statistic 

n n 

(2.1) T E ,L(k n ) = ^2c in Vi, n = Yl 

1=1 2=71 — fcji + 1 

The method of CGJ1967 for establishing the asymptotic distribution of 
TE,L(k n ) rests on the following well-known property [cf. David and Nagaraja 
(2003), pages 17 and 18]. 

Lemma 2.1. The vector of order statistics (Vi >n , . . . ,V niTl ) may be rep- 
resented in distribution by 



(V 1 , n ,...,V n , n ) = (Y 1 ,...,Y n ), 



where 



J ,_H + _^ + ... + _!^_j : 

n n—1 n — i + 1 ^-^ 



j 



n—i+1 ^ n—i+1 
i=i 

Thus, with TE t L(k n ) given by (2.1), it can be represented in distribution as 

n 

(2-2) T E , L (k n )=J2 

where a E ,j n (k n ) = K/ (n- j + 1) for j <n-k n and a E ,jn(k n ) = 1 for j > 
n k n . 

Relation (2.2) expresses TE t L(k n ) as a linear combination of the indepen- 
dent random variables V\ , . . . , V n which enables the use of standard asymp- 
totic results for establishing conditions for its asymptotic distribution. This 
is given, without proof, in the following. 

Theorem 2.1. Let k n ,n>l, be any sequence of integers which satisfies 
k n — > oo, as n — > oo, and k n < n, and let TE,h[kn) be given in (2.1). Then 
we have 

/no , ^ „ \ T E , L (k n) / ji— 1 CVe in\^n) d ., /n lN 

(2-3) T EL (kn) = = == >-JV(0,l) asn^oo. 
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In the case where the observations Yi, i = 1, . . . ,n, come from a distri- 
bution function F, the CGJ1967 approach for obtaining the asymptotic 
distribution of the order threshold statistic 

n n 

(2.4) T F , L (k n ) = ^c in Yi, n = Yl 

i=l i=n—k n +l 

where Yi >n < ••• < Y n ^ n are the ordered Y^s, is based on the expression 
Y^ n = Hp(Vi )n ), where H F = F -1 o G, and G is the standard exponential 
distribution function, and the use of Taylor expansion to obtain: 

Lemma 2.2 [Chernoff, Gastwirth and Johns (1967)]. Let T F ^ L (k n ) be 
given by (2.4)- Then, 

n' 1 T FL (k n ) = HF,n(k n ) + QF,n{k n ) + RF,n{K), 

where 

1 - 

HF,n(k n ) = —y^^CinHpfan), 
i=l 

1 n 

QF,n(k n ) = -)U F i n (k n )(Vi - 1) 
i=l 

and 

1 n 

RF,n{k n ) = - y^Ci n {(H F (Vi n ) - H F (D in )) - (Vi n - Vin)H' F (u in )} 
i=l 

with a F)in (k n ) = YTj=i CjnH' F (i>j n ) and D in = Y?j=i n -)+i • 

They then provide conditions under which Q F , n (k n ) is asymptotically 
normally distributed and the remainder term, R F , n (k n ), tends to zero in 
probability. 

3. Single sequence of iV(0, 1) random variables. In this section, we will 
apply the approach of CGJ1967 to develop order threshold test procedures 
for testing the simple hypothesis 

(3.1) Ho:0i=Q Vz versus H a : Hq is false 

based on a sequence of observations Xi,i = 1, . . . ,n, where X, ~ N(6i, 1). 
The asymptotic null distribution of the order threshold statistic given by 
(1.1) is derived in the next subsection, while simulation results comparing 
the power of the hard threshold statistic, a power-enhanced version of the 
Simes (1986) statistic, and that of order threshold statistics are presented in 
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Section 3.2. The simulation results suggest that choosing the order threshold 
parameter equal to the number of the false null hypotheses maximizes the 
power. Section 3.3 presents a recommendation for a data-driven choice of 
the order threshold parameter using the idea of Storey (2002, 2003). 

3.1. The asymptotic null distribution. Let Xj, i = 1, . . . ,n, be standard 
normal random variables, and let 

n n 
(3-2) T^^kn) = ^ ^ Ci n Yj^ n = ^ ^i,rM 

i=l i=n— fcn+1 

where Yi = Xf, Y\. n < • • • < Y n>n are the ordered Y^s, Ci n = I(i > n — k n ), 
and k n is the order threshold parameter. The approach of CGJ1967 is based 
on the representation 

n 

T L {h n )± ^{V^), 

i=n— fc n +l 

where Vi tn , i = 1, .. . ,n, are the ordered observations from an i.i.d. sequence 
of Exp(l) random variables, and 

H(v) = F~ 1 oG(v) 
with F{y) = -^/(fr^e-t/a^y > 0) and g{v) = l-e~\v>Q. Let 



1 n i n 

(3.3) Hn(K) = - y^c in H(D in ), o-n{K) = -y^af n (k n ) 

i=l i=l 

where 

1 n 



n 

j=i 

(3.4) 



'in. 



v — !— . 

^ re - j + 1 

.7=1 



The term of a in (k n ) can be re-expressed as a in (k n ) = n _] +1 YJj=n-k n +i H'{v jr 
for i<n-k n and a in (k n ) = n _) +1 YJj=i H for i > n-fc„ with H'(Dj n ) = 
— — — r^— and the function f is the derivative of F. With this notation 

/(F -l(l_ e >)) J 

we have the following. 

Theorem 3.1. Let Yi, i = 1, . . . ,re, be a sequence of i.i.d. random vari- 
ables having the central chi-squared distribution with 1 degree of freedom. Let 
k n ,n > 1, be any sequence of integers which satisfies k n — > oo, as n — > oo, and 
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k n < n. Let ^. n {k n ) and a^(k n ) be as in (3.3) with Ci n = I(i > n — k n ), and 
let Tf,(/c n ) be given in (3.2). Then we have 

to c\ rr*a \ L\k n ) — nfj, n \k n ) d ,w„ -.x 

(3.5) T L [k n ) = — — — - >N(0,1) asn-^oo. 

Vna n (k n ) 

Note that the asymptotic mean of Tx(/c n ) is nfi n (k n ) and the asymptotic 
variance of Ti,(k n ) is na^(k n ) as k n tends to infinity with n. 

3.2. Simulations. In this subsection, we compare the empirical power of 
the order threshold statistic using several values of the threshold parameter 
with those of the hard threshold and a power-enhanced version of Simes 
(1986) statistics. The original Simes multiple testing procedure rejects the 

global hypothesis, Hq, that all Hq :6i = 0, i = 1, . . . ,n, are true if 

T s = min {nPu)/i} < a, 

l<i<n 

where Pm < • • • < Pr n \ are the ordered p- values of the individual hypotheses, 
and a is the desired level of significance. A power-enhanced version of the 
original Simes test procedure uses a/(l — /n) instead of a, where A^ pt 
is the number of false null hypotheses. 

The simulations reported here use samples of size n = 500 generated from 
the normal distribution with variance 1. The threshold parameter fcsoo of 
the order threshold statistics takes values of 15, 40, 70, 100, 200, 500, as 
well as a data-driven value, denoted by k^Q , whose description is given in 
Section 3.3. The empirical power using the approximation Ix^qq) ~ bx„ 
is reported together with that using the normal approximation to Tl^qq). 
The hard threshold statistic we consider uses the recommended value of the 
threshold parameter which is (5soo = 21og(5001og -2 500) = 5.1216. All results 
are based on 3000 simulation runs. Since the the global hypothesis Hq is 
the same for all three simulation settings, the type I error rates reported in 
the last row of Table 5 pertain also to Tables 6 and 7. Note that all achieved 
significance levels are below 0.06. The alternatives considered have 30 of the 
500 mean values different from zero. In particular, we consider the following 
sequence of alternatives indexed by r: 

H r : 9j = Tfj+T-x for j = 1, ... , 500, r = 1, . . . , 30, 

where m, j = 1,2, ... , is a given sequence. The following are examples with 
different values of rj. 

Example 3.1. We generate the values of rjj, j = 1, . . . ,30, from N(1.5, 1). 
The rest values of rjj are 0. The values different from are as follows: 

(1.0674, -0.1656, 1.6253, 1.7877, 0.3535, 2.6909, 2.6892, 
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Table 5 

Power calculations in Example 3.1 



klU Ts TH(5.122)Ti(fe°g*) b X lT L (15)T L (40)T L (70)T L (100)T L (200)T L (500) 



Hi 


30 


0.843 


0.944 


0.977 


0.975 


0.976 


0.973 


0.968 


0.960 


0.938 


0.913 


H 3 


28 


0.845 


0.942 


0.978 


0.975 


0.976 


0.975 


0.969 


0.961 


0.937 


0.910 


H B 


26 


0.840 


0.926 


0.972 


0.970 


0.971 


0.966 


0.956 


0.943 


0.911 


0.879 


H- 


24 


0.796 


0.893 


0.950 


0.948 


0.949 


0.942 


0.929 


0.915 


0.880 


0.851 


H 8 


23 


0.777 


0.845 


0.933 


0.928 


0.932 


0.915 


0.891 


0.875 


0.818 


0.775 


Hio 


21 


0.764 


0.817 


0.908 


0.900 


0.907 


0.891 


0.868 


0.841 


0.785 


0.744 




20 


0.766 


0.792 


0.905 


0.899 


0.906 


0.883 


0.853 


0.832 


0.764 


0.712 


Hl2 


19 


0.764 


0.783 


0.903 


0.897 


0.903 


0.873 


0.841 


0.812 


0.751 


0.709 


His 


18 


0.750 


0.752 


0.881 


0.875 


0.880 


0.845 


0.804 


0.776 


0.709 


0.662 


H14 


17 


0.739 


0.734 


0.864 


0.858 


0.869 


0.836 


0.789 


0.760 


0.694 


0.649 




16 


0.559 


0.574 


0.724 


0.707 


0.723 


0.671 


0.633 


0.608 


0.541 


0.495 


Hie 


15 


0.526 


0.564 


0.707 


0.693 


0.707 


0.660 


0.611 


0.574 


0.517 


0.484 


Hn 


14 


0.532 


0.529 


0.675 


0.661 


0.677 


0.625 


0.574 


0.542 


0.467 


0.432 


His 


13 


0.464 


0.435 


0.584 


0.568 


0.590 


0.534 


0.496 


0.458 


0.404 


0.373 


Hig 


12 


0.483 


0.402 


0.570 


0.556 


0.574 


0.500 


0.459 


0.427 


0.374 


0.347 


H20 


11 


0.470 


0.380 


0.547 


0.533 


0.551 


0.475 


0.425 


0.395 


0.343 


0.308 


H21 


10 


0.467 


0.390 


0.555 


0.540 


0.559 


0.490 


0.433 


0.402 


0.341 


0.319 


H22 





0.460 


0.364 


0.534 


0.515 


0.535 


0.454 


0.402 


0.368 


0.313 


0.281 


H23 


8 


0.460 


0.362 


0.517 


0.503 


0.522 


0.447 


0.389 


0.351 


0.301 


0.279 


H24 


7 


0.417 


0.290 


0.450 


0.434 


0.455 


0.375 


0.318 


0.288 


0.248 


0.230 


H§ 





0.052 


0.050 


0.059 


0.057 


0.057 


0.054 


0.052 


0.051 


0.052 


0.055 



1.4624, 1.8273, 1.6746, 1.3133, 2.2258, 0.9117, 3.6832, 

1.3636, 1.6139, 2.5668, 1.5593, 1.4044, 0.6677, 1.7944, 0.1638, 

2.2143, 3.1236, 0.8082, 2.7540, -0.0937, 0.0590, 2.0711, 2.3579). 

Note that #(j : < \r)j\ < 1, j = 1, 2, . . .) = 8, #(j:K \rij\ < 2,j = 1,2,...) = 
12, #(j:2<\ Vj \ <3,j = l,2,...) = 8, and #(j : \ Vj \ > 3,j = 1, 2, . . .) = 2. 

Example 3.2. We generate the values of rjj, j = 1,...,30, from the 
standard exponential distribution. The remaining values of rjj are 0. The 
values different from are as follows: 

(0.0512, 1.4647, 0.4995, 0.7216, 0.1151, 0.2716, 0.7842, 

3.7876, 0.1967, 0.8103, 0.4854, 0.2332, 0.5814, 0.3035, 

1.7357, 0.9021, 0.0667, 0.0867, 0.8909, 0.1124, 2.8491, 1.0416, 

0.2068, 2.6191, 1.9740, 1.5957, 1.6158, 0.5045, 1.3012, 1.6153). 

Note that #(j :0 < r]j < = 1,2, . . .) = 19, #(j : 1 < ly < 2, j = 1,2, . . .) = 
8, #(j:2 < rjj < 3, j = 1,2, . . .) = 2, and #(j: Vj > 3, j = 1,2, . . .) = 1. 
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Table 6 

Power calculations in Example 3.2 



klU Ts TH(5.122)Ti(fc°g*) b X lT L (15)T L (40)T L (70)T L (100)T L (200)T L (500) 



Hi 


30 


0.650 


0.574 


0.759 


0.745 


0.760 


0.699 


0.651 


0.608 


0.548 


0.513 




2!) 


0.680 


0.584 


0.755 


0.741 


0.761 


0.700 


0.649 


0.612 


0.544 


0.504 


J J 6 


28 


0.652 


0.565 


0.745 


0.729 


0.747 


0.684 


0.640 


0.602 


0.540 


0.498 


H e 


25 


0.666 


0.549 


0.728 


0.717 


0.732 


0.667 


0.625 


0.591 


0.521 


0.479 


H T 


24 


0.677 


0.562 


0.743 


0.729 


0.745 


0.686 


0.632 


0.591 


0.522 


0.482 


H 8 


23 


0.666 


0.536 


0.716 


0.703 


0.724 


0.657 


0.612 


0.569 


0.508 


0.478 


Hio 


21 


0.340 


0.350 


0.449 


0.434 


0.445 


0.418 


0.394 


0.367 


0.333 


0.317 


H\2 


19 


0.351 


0.342 


0.444 


0.426 


0.443 


0.410 


0.383 


0.362 


0.341 


0.305 


Hl3 


18 


0.342 


0.330 


0.456 


0.442 


0.450 


0.416 


0.388 


0.367 


0.335 


0.316 


Hii 


17 


0.350 


0.331 


0.448 


0.432 


0.451 


0.412 


0.377 


0.363 


0.325 


0.300 


His 


16 


0.337 


0.334 


0.432 


0.416 


0.431 


0.402 


0.375 


0.356 


0.327 


0.307 


Hie 


15 


0.330 


0.294 


0.406 


0.393 


0.403 


0.371 


0.338 


0.319 


0.293 


0.274 


Hir 


14 


0.357 


0.282 


0.399 


0.387 


0.403 


0.352 


0.323 


0.305 


0.267 


0.252 


His 


13 


0.325 


0.290 


0.393 


0.378 


0.390 


0.358 


0.329 


0.312 


0.276 


0.261 


Hig 


12 


0.337 


0.296 


0.413 


0.396 


0.412 


0.368 


0.337 


0.314 


0.277 


0.255 


H20 


11 


0.343 


0.291 


0.399 


0.383 


0.399 


0.349 


0.314 


0.296 


0.270 


0.250 


H21 


10 


0.346 


0.290 


0.405 


0.391 


0.404 


0.356 


0.321 


0.306 


0.268 


0.248 


H22 


9 


0.224 


0.198 


0.264 


0.251 


0.262 


0.237 


0.220 


0.208 


0.195 


0.189 


H23 


8 


0.196 


0.190 


0.257 


0.242 


0.253 


0.228 


0.216 


0.197 


0.191 


0.182 


H24 


7 


0.207 


0.182 


0.256 


0.245 


0.253 


0.225 


0.212 


0.200 


0.186 


0.179 



Example 3.3. In this example, the values of r]j, j = 1, ... ,30, are 2.0 
and the rest are zero. 

As expected, the power in each column decreases by increasing r because 
the number of 6 with values different from zero (denoted by A^qq) decreases. 
When the 6% with the large value such as 3.6832, 3.1236 (in Example 3.1) and 
3.7876 (in Example 3.2) is excluded at the alternative, the large decrement 
in the power occurs. For each alternative, the statistic 7^(15) or Ti(40) 
achieves better power than the order threshold statistics with the other 
specified values of the threshold parameter. This is a consequence of the 
fact that the number of mean values that are different from zero never 
exceeds 30. Thus, less noise is incorporated in T^A^qq) than the other order 
threshold statistics. Note that with the chosen value of J500 = 5.1216, the 
hard threshold statistic uses, on average, 12 observations. Thus, it is rather 
surprising that the empirical power of the hard threshold statistic is always 
smaller than that of Tj,(15). In all three tables, the empirical power using the 
approximation TlQ^q) ~ $ is similar to that of T^A^qq), and always greater 
than the empirical powers of the hard threshold and Simes statistics. The 
empirical power using the approximation T/^A^qq) ~ bx„ is a little bit smaller 
than that using the normal approximation, however, it is still greater than 
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Table 7 

Power calculations in Example 3.3 



klU Ts TH(5.122)Ti(fe°g*) b X lT L (15)T L (40)T L (70)T L (100)T L (200)T L (500) 



#1 


30 


0.674 


0.959 


0.973 


0.970 


0.969 


0.976 


0.982 


0.981 


0.970 


0.962 


H 3 


28 


0.643 


0.954 


0.966 


0.960 


0.955 


0.974 


0.974 


0.973 


0.960 


0.947 


Ha 

J J 4 


27 


0.617 


0.935 


0.957 


0.954 


0.945 


0.965 


0.963 


0.961 


0.950 


0.934 


H 6 


2.") 


0.598 


0.900 


0.936 


0.931 


0.926 


0.947 


0.943 


0.941 


0.922 


0.903 


H 8 


23 


0.566 


0.872 


0.912 


0.905 


0.902 


0.920 


0.917 


0.911 


0.891 


0.865 


Hw 


21 


0.529 


0.831 


0.877 


0.869 


0.862 


0.889 


0.886 


0.875 


0.849 


0.817 




19 


0.509 


0.777 


0.837 


0.828 


0.821 


0.843 


0.833 


0.821 


0.786 


0.753 




18 


0.481 


0.740 


0.816 


0.803 


0.802 


0.813 


0.803 


0.785 


0.738 


0.703 




17 


0.472 


0.715 


0.785 


0.773 


0.772 


0.784 


0.773 


0.763 


0.710 


0.671 




16 


0.448 


0.674 


0.748 


0.732 


0.736 


0.749 


0.735 


0.715 


0.669 


0.633 




15 


0.418 


0.630 


0.715 


0.700 


0.702 


0.706 


0.686 


0.668 


0.624 


0.585 




11 


0.393 


0.569 


0.658 


0.645 


0.645 


0.646 


0.629 


0.610 


0.562 


0.523 


His 


13 


0.368 


0.522 


0.629 


0.616 


0.623 


0.620 


0.597 


0.573 


0.523 


0.489 


Hig 


12 


0.341 


0.498 


0.593 


0.577 


0.582 


0.582 


0.552 


0.525 


0.486 


0.451 


H20 


11 


0.328 


0.441 


0.539 


0.527 


0.539 


0.519 


0.491 


0.472 


0.436 


0.407 


H21 


10 


0.306 


0.390 


0.487 


0.470 


0.480 


0.464 


0.436 


0.421 


0.382 


0.353 


H22 





0.285 


0.354 


0.439 


0.423 


0.438 


0.422 


0.393 


0.379 


0.344 


0.317 


H23 


8 


0.260 


0.298 


0.393 


0.374 


0.386 


0.367 


0.342 


0.318 


0.292 


0.276 


H24 


7 


0.245 


0.265 


0.349 


0.333 


0.346 


0.315 


0.296 


0.283 


0.255 


0.236 


H25 


6 


0.221 


0.224 


0.300 


0.286 


0.295 


0.272 


0.257 


0.245 


0.228 


0.213 



the empirical powers of the hard threshold and Simes statistics. In Table 7, 
for large number of the false null hypotheses the Simes statistic T$ performs 
much worse than the hard threshold statistic, the order threshold statistic, 
and even the chi-square statistic 7l(500). In all three tables, the power 
of T#(5.1216) is similar (though somewhat smaller) to that of Ti(100). 
Finally, all order threshold statistics achieved higher power than the chi- 
square statistic Tl(500). 

3.3. Choosing k n . The simulation results and the discussion in the clos- 
ing paragraph of Section 3.2 suggest that the power of Ti(k n ) is largest 
when k n equals the number of mean values different from zero (denoted by 
fcn pt )- As a data-driven choice of k n , we propose to use the estimate of k n pt 
suggested by Storey (2002, 2003) and Efron et al. (2001), which is 

C t (A) = max{ ^ (A i ) _-; A - 1 ,log^n}, 

where G n is the empirical cdf of P n = (Pi, . . . , P n ), the Pj's are the p- values 
of the individual hypotheses, and A is the median of the p's. The recom- 
mended lower bound log 3 ^ 2 n of k n pt (X) was found to be preferable in the 
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simulations we performed. Interestingly, log 3//2 n equals the expected num- 
ber of observations in hard thresholding with the recommended threshold 
parameter of 5 n = 2 log (n log" 2 n). 

4. One-way HANOVA. Let the Xij, i = 1, . . . , a, j = 1, . . . ,n, be indepen- 
dent N(9i,a 2 ), where the 9i and a 2 are all unknown. Let aj = 9i — 9 denote 
the "effect" of the ith group, and consider testing Hq : a\ = ■ ■ ■ = a a = vs. 
H a : Hq is false. Akritas and Papadatos (2004) show that the asymptotic 
power of the optimal invariant ANOVA F test equals its level of significance 
even when ||o:|| — > oo, as a—)- oo, with ||a;|| 2 = o{yfa). Because the power of 
the chi-square statistic (1.2) has a similar property [Fan (1996)], an exten- 
sion of the order thresholding to the one-way HANOVA setting is expected 
to result in similar gains in power over the ANOVA F test. 

In Section 4.1, we extend the applicability of order thresholding to the 
one-way HANOVA context, while Section 4.2 illustrates the improved power 
of order thresholding via simulation. Finally, using the idea of Storey (2002, 
2003) and the simulation results, we present a recommendation for a data- 
driven choice of the order threshold parameter in Section 4.3. 

4.1. Order thresholding in one-way HANOVA. The classical F statistic 
is given by 

(4.1) 



a MSE : 
where 



MST = — -J>CXi. -X.) 2 , MSE = ^— J^OXtf - Xi) 2 

i=l i=l j=l 

with x t . = n~ x Y?j=i x a > 3r» = N ~ x E"=i EJ=i x v > and N = an - Note that 

differs from the chi-square statistic (1.2) only in that the random variables 
which are being summed are not independent, and their distribution is not 
Xl Set 

— , , &i 



Thus, 



Vmse o- 



~ ^ a 
Z; = sZj where s 
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Threshold versions of (4.2) are of the form 

a a 
(4-3) f L {k a ) = Y^C la Zl a = S^CiaZl, 

i=l i=l 

where Z\ a < ■ ■ ■ < Z 2 a are the ordered Z 2, s, Z^ a = sZi^ a , Ci a = I(i > a — k a ), 
and k a is the order threshold parameter. For suitable centering and scaling 
constants, ju a (A; a ) and a a (k a ), the asymptotic theory of Ti{k a ) will use the 
decomposition 

f L {k a ) - aMka) _ 2 1 ^ _ n?l (k A 

(4-4) 

3 -£„(fca)( S 2 -l). 



The two components in (4.4) are independent, so it suffices to show the 
asymptotic normality of each one separately. To deal with the first compo- 
nent, let 6q denote the common value of the 6i under Hq and write 

(4.5) % = Z, + 4= where Z, = - »> and , = - ^^2> . 

Our approach for obtaining the asymptotic distribution of X^=i °ia^ia * s t° 
first derive its asymptotic distribution treating the t in (4.5) as fixed, and 
then to show that the convergence is uniform over all values of t bounded by 
any positive constant M. By Slutsky's theorem, the asymptotic distribution 
of the first component of (4.4) is the same as that of Yli=i c ia^i a - The 
asymptotic distribution of the second component of (4.4) is easily derived 
since 

y/ais 2 - 1) A N\ 0, 2 ) asa-^oo, 
V n-lj 

and, as it will be shown in lemmas described in Section 4.1.3, fiaika) /a a (k a ) — > 
Hr/&r, provided that k a /a — > r for some < r < 1 and k a — > oo, as a — > oo. 

4.1.1. Asymptotic distribution when t is fixed. When t is fixed, we set 

t a 

(4.6) Z t)i = Zi + — , i = l,...,a, T t L (k a ) = ^2c ia Z 2 (i) , 

where Z 2 ^ < • • • < Z 2 ^ are the order statistics of Z 2 1} . . . , Z 2 a . [Note that 

for t as defined in (4.5), Ztj becomes Z{\ It follows that the Zf i are inde- 
pendent x[(t 2 /a) so that their density and cumulative distribution functions 
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are given by 

e -l/2(^/a) y -l/2 « 

9aAV)- 2 i/ 2 ^2 2fc fc!r(fe + l/2)' y 



fc=0 

and 

00 „ /+2\k 



ry 1 / j.2 \ « 

G„,t(y) = ^ 5a, i (n)^ = ^e- i2 /( 2a ) F - iy (^-J G 2fe+1 (y), y > 0, 



fc=0 

respectively, where 



is the cumulative distribution function of X^(0). Let 

(4.7) nl{k a ) = -Y,CiaG- a l{l-e-^) and (a*^)) 2 = - ^(<4(fc.)) 2 , 

i=i i=i 

where a£,(*.) = ^ £° =l and = 3=7+1- With 

this notation we have the following lemma. 

Lemma 4.1 [Chernoff, Gastwirth and Johns (1967)]. Let Tl(k a ) and 
Mo(^o) ^ e as defined in (4-6) and (4-7), respectively. Let V±,...,V a be i.i.d. 
from Exp(l) random variables and let V\ ta < ■ ■ ■ < V a ,a be the corresponding 
order statistics. Then a~ l T t L {k a ) can be decomposed as 

a- l Tt(k a ) i ^ a (k a ) + Qi(k a ) + Rl(k a ), 

where 

(4.8) Q t a (k a ) = ±J2a t ia (k a )(V i -l) 
and 

with a\ a {k a ) = ^ c ^ 9aAG ^(i-e-^)) and » ia = Ej=i ^+T- 

Theorem 4.1. For any fixed value of t, let Z^ i; i = 1, . . . ,a, be a se- 
quence of i.i.d. random variables having the noncentral chi- squared distribu- 
tion with 1 degree of freedom and noncentrality parameter t 2 /a. Let k a , a > 1, 
be any sequence of integers which satisfies k a — > 00, as a —> 00, and k a <a. 
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Let /U*(& a ) and {o' t a {k a )) 2 be as in (4-7) with Ci a = I(i > a — k a ), and let 
Tj j {k a ) be given in (4-6). Then we have 

(4-9) T?(ka) = Tlik i~ ff a) 4 N(0, 1) as a ^oo. 

4.1.2. Uniformity of the convergence in distribution. This subsection shows 
that the distribution function of (4.9) converges to the standard normal dis- 
tribution uniformly on \t\ < M. 

Lemma 4.2. Consider the setting of Theorem 4.1. Let H a ,t be the dis- 
tribution function of y/aQ^ka) / a^ka) , where Qa(k a ) is given in (4-8), and 
let be the standard normal distribution function. Then, for any M > 0, 

sup \H a j(x) — <j?(x)| — > as a— >oo. 

—M<t<M 
— oo<a;<oo 

Lemma 4.3. Consider the setting of Theorem 4-1, and let R^ka) be as 
given in Lemma 4-1- Then, for any M > 0, 



sup 

-M<t<M 



— > I) as a — > oo. 



Lemma 4.4. Consider the setting of Theorem 4-1- Let F a ^ be the distri- 
bution function of T l L (k a ) given in (4-9) and let <1> be the standard normal 
distribution function. Then, for any M > 0, 

sup \F ajt (x) — Q(x)\ — > as a— >oo. 

~AKt<M 
— oo<x<oo 

Theorem 4.2. Let k a ,a > 1, be any sequence of integers which satisfies 
k a — > oo, as a—> oo, and k a < a. For t as defined in (4-5), let Zi, /I a (fc a ) and 
(& a {k a )) 2 be as in (4-6), (4-7), respectively. Then we have 

(A i n\ fr*n ^ Y^i=i c iaZl a — a^ a (k a ) d 

(4.10) T L (k a ) = ' >N(0,1) as a ->oo, 

y/aa a {k a ) 

where Z\ a < ■ ■ ■ < Z\ a are the ordered Zf 's and Ci a = I(i> a — k a ). 

4.1.3. Asymptotic normality of the order threshold statistics. In this sub- 
section, it is first shown that Ha(k a ) and <7*(fc a ) converge to ^° a {k a ) and 
o~®{ka), respectively, uniformly on |i| < M. This fact is then used in Theorem 
4.3 for obtaining the asymptotic normality of the order threshold statistic 
given in (4.3). 
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Lemma 4.5. Let k a ,a > 1, be any sequence of integers which satisfies 
k a — >oo, as a— >oo, and k a < a. Let (a^ka)) 2 and (a^(k a )) 2 be as in (4-7) 
with any t and fixed value of t = 0, respectively. Then, for any M > 0, 



sup 

-M<t<M 



1 







as a — > oo. 



Lemma 4.6. Let k a ,a > 1, be any sequence of integers which satisfies 
k a — > oo, as a — >■ oo, and k a < a. Let (k a ), ^a(^a) and (c°(A; a )) 2 be as in 
(4-7) with any t, fixed value of t = 0, respectively. Then, for any M > 0, 



sup 

-M<t<M 







as a — > oo. 



Lemma 4.7. Let fj,®(k a ) and {o~^{k a )) 2 be as in (4-7) with the fixed value 
of t = 0. Then, provided that k a ja^t r for some < r < 1 and k a — > oo, as 
a — > oo, we have 



and {o-° a {k a )y 



as a — > oo, 



where 



Hr= f I(t>l-r)G-i(t)dt 
Jo 



and 



° 2 r= t f I(t>l-r)L(s>l-r)(mm(t,s)-ts)dG-i(t)dG-i(s). 
Jo Jo 



Remark. If r = 1, then 



o-° a {k a ) V2 



as a — > oo. 



From Theorem 4.2 and lemmas described earlier in this subsection, we 
can obtain the following theorem. 



Theorem 4.3. Let fi^(k a ), (a^(k a )) 2 , \i r , and a 2 be as in Lemma 4-7, 
and let TL{k a ) be given in (4-3). Then, provided that k a /a —> r for some 
< r < 1 and k a — > oo, as a — > oo, we have 

/a -i -i \ rj-i / 1 \ T L (k a ) - an° a (k a ) d AT f n , . 2$ 
(4.11) T L (k a ) = ^ \ >N{ 0,1 + — — | 



a 2 (n — 1) 
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4.2. Simulations. In this subsection, we compare the performance_of the 
classical F statistic, given in (4.1), and the order threshold statistics Ti,(k a ), 
given in (4.11). 

We remark that Fan and Lin (1998) applied the thresholding methodol- 
ogy to the problem of comparing / curves with data arising from the model 
x ij(t) = fi(t) + £ ij (*)>*= 1> • • -, T J = !,-■■, rii,i = l,...,I. Their asymptotic 
theory pertains to the case where the number of curves which are compared, 
/, remains fixed, while T and the sample sizes rii tend to infinity. This prob- 
lem is fundamentally different from that considered here, and their procedure 
is not a competitor to ours. 

Figure 2 presents the estimated densities of Ti,(fc5oo) (solid line) and 
the density of the limiting normal distribution (dash-dot line). The esti- 
mated densities are based on 20,000 simulated values, using a = 500 and 
n = 3, when the threshold parameter k$oo takes the values of [a 1 / 2 ] = 22, 
[a 3 / 4 ] = 105, and [a 7 / 8 ] = 229. It can be seen that the approximation is quite 
good especially for k^oo = 105 and 229. Similar figures (not shown here) 
with different values of n suggest that the rate of convergence of the order 
threshold statistic to its limiting distribution is mainly driven by a, not n. 

The results reported in Table_8 are based on 20,000 simulation runs. As 
expected, the distributions of T[,(fc a ) converge to the normal distribution 
function and the achieved alpha levels are close to the true value of 0.05. 
Thus, the asymptotic theory of the order threshold statistics provides a 
good approximation. More exactly, when the number of groups are larger 
than 200, all order threshold statistics are robustjbr the 0.05 significance 
level. In particular, the achieved alpha level of Tl (fciooo) is 0.0507 when 
a = 1000, n = 5, and ki 000 = [a 7 / 8 ] = 421. 

From now, we compare the empirical power of Tz,(fciooo) using several 
values of the threshold parameter with that of the classical F statistic. The 
simulations use samples of size a = 1000 and n = 5 generated from the nor- 
mal distribution with variance 1. The threshold parameter /ciooo is 20, 50, 
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100, 250, 500, and 1000. All results are based on 20,000 simulation runs. 
The alternatives here have 20 of the 1000 9i values different from zero. In 
particular, we consider the following sequence of alternatives indexed by r: 

H r : 0j = r) j+r -i for j = 1, . . . , 1000, r = 1, . . . , 20, 

where r/j, j = 1,2, . . . , is a given sequence. The following are examples with 
different values of 77. 

Example 4.1. We generate the values of r]j, j = 1,...,20, from 
Uniform(— 2, 2). The remaining values of rjj are 0. The values different from 
are as follows: 

(1.8005, -1.0754,0.4274, -0.0561, 1.5652, 1.0484, 

-0.1741, -1.9260, 1.2856, -0.2212, 0.4617, 1.1677, 1.6873, 

0.9528,-1.2949, -0.3772, 1.7419, 1.6676, -0.3589, 1.5746). 

Note that #(j : < 1^1 < l,j = 1, 2, . . .) = 8 and #{j : \ Vj \ > 1, j = 1, 2, . . .) = 
12. 

Example 4.2. We generate the values of rjj, j = 1, . . . , 20, from Exp(0.7). 
The remaining values of r]j are 0. The values different from are as follows: 

(1.0949, 0.5511, 1.7587, 0.1128, 0.4033, 0.7991, 0.6868, 

0.0993, 0.6919, 1.8255, 1.1272, 2.1041, 0.3975, 

1.4730, 0.4549, 1.5015, 0.1830, 0.6865, 0.1360, 2.1458). 

Note that #(j :0 < Vj < 1, j = 1, 2, . . .) = 12, #(j : 1 < Vj < 2, j = 1, 2, . . .) = 6 
and#(j:r ?J >2,j = l,2,...) = 2. 



Table 8 

Type I errors of order threshold statistics, TL(k a ). for different values of the threshold 

parameter 







[log 1/2 a] 


[log a] 


[log 3 / 2 a] 


[aV»] 


[a 2/3 ] 


[« 3/4 ] 


W /s ] 


a 


a = 


50 and n = 3 


0.0522 


0.0551 


0.0601 


0.0601 


0.0623 


0.0635 


0.0637 


0.0669 


a = 


50 and n = 5 


0.0551 


0.0583 


0.0591 


0.0591 


0.0588 


0.0600 


0.0612 


0.0619 


a = 


100 and n = 3 


0.0506 


0.0521 


0.0561 


0.0563 


0.0594 


0.0607 


0.0617 


0.0634 


a = 


100 and n = 5 


0.0539 


0.0541 


0.0541 


0.0549 


0.0571 


0.0578 


0.0596 


0.0604 


a = 


200 and n = 3 


0.0436 


0.0440 


0.0490 


0.0497 


0.0552 


0.0571 


0.0601 


0.0597 


a = 


200 and n = 5 


0.0548 


0.0520 


0.0505 


0.0504 


0.0515 


0.0529 


0.0542 


0.0549 


a = 


500 and n = 3 


0.0436 


0.0437 


0.0452 


0.0466 


0.0515 


0.0558 


0.0593 


0.0589 


a = 


500 and n = 5 


0.0533 


0.0492 


0.0474 


0.0481 


0.0510 


0.0518 


0.0532 


0.0534 


a = 


1000 and n = 3 


0.0427 


0.0403 


0.0405 


0.0411 


0.0475 


0.0513 


0.0548 


0.0557 


a = 


1000 and n = 5 


0.0517 


0.0486 


0.0459 


0.0453 


0.0466 


0.0484 


0.0507 


0.0521 
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Table 9 









Power calculations 


in Exampl 


2 4.1 








1 opt 


-F1000 


Tt (201 

1^ \ j 


Tl (50) 


T L (100) 


T L (250) 


T L (500) 


Tl(IOOO) 


H, 


20 


0.8612 


0.9992 


0.9975 


0.9877 


0.9482 


8923 


8682 


Ho 

112 


19 


0.7887 


0.9963 


0.9889 


9685 


9000 


0.8270 


0.7978 




18 


0.7561 


0.9957 


0.9878 


9623 


0.8762 


0.7971 


0.7658 


Ha 
114 


17 


0.7505 


0.9952 


0.9848 


9588 


0.8743 


0.7924 


0.7601 


11~) 


16 


0.7541 


0.9949 


0.9841 


0.9591 


0.8801 


0.7944 


0.7633 


Hi 

J J 


15 


0.6785 


0.9901 


0.9712 


0.9275 


0.8175 


0.7238 


0.6891 


H- 


14 


0.6434 


0.9859 


0.9634 


0.9116 


0.7856 


0.6887 


0.6563 


H 8 


13 


0.6432 


0.9855 


0.9623 


0.9100 


0.7876 


0.6905 


0.6547 


Hg 


12 


0.5091 


0.9422 


0.8861 


0.8008 


0.6505 


0.5518 


0.5193 


Hio 


11 


0.4434 


0.9191 


0.8399 


0.7351 


0.5794 


0.4868 


0.4553 




10 


0.4444 


0.9191 


0.8399 


0.7355 


0.5742 


0.4855 


0.4561 




9 


0.4448 


0.9230 


0.8414 


0.7333 


0.5760 


0.4847 


0.4562 


H\z 


8 


0.3896 


0.8894 


0.7869 


0.6756 


0.5132 


0.4264 


0.4007 




7 


0.2887 


0.7710 


0.6364 


0.5169 


0.3835 


0.3185 


0.2989 




6 


0.2615 


0.7437 


0.6051 


0.4866 


0.3537 


0.2903 


0.2724 


Hie 


5 


0.2095 


0.6603 


0.5037 


0.3878 


0.2803 


0.2321 


0.2187 




4 


0.2089 


0.6560 


0.5002 


0.3869 


0.2742 


0.2319 


0.2169 


His 


3 


0.1356 


0.4002 


0.2874 


0.2250 


0.1686 


0.1482 


0.1421 


Hjg 


2 


0.0816 


0.1736 


0.1287 


0.1106 


0.0943 


0.0884 


0.0867 


H20 


1 


0.0812 


0.1743 


0.1277 


0.1095 


0.0934 


0.0880 


0.0862 



As expected, the power in each column decreases as r increases and T^(20) 
has the highest power. Since the number of Oi's that are different from zero 
does not exceed 20, Tl(20) minimizes the accumulation of noise, compared 
to the other order threshold statistics. For each alternative, the largest power 
differences between -F1000 and Tl (20) are about 0.5 (alternative H13 in Table 
9) and 0.54 (alternative H\2 in Table 10). In both tables, the power of 
Tk(1000) is similar to that of F1000 because T/^IOOO) is a standardized 
version of -Fiooo- Finally, all order threshold statistics achieved higher power 
than the classical F statistic -Fiooo- 

4.3. Choosing k a . The simulation results and the discussion in the clos- 
ing paragraph of Section 4.2 suggest that choosing k a equal to the number 
of groups with nonzero effects, ka Pt , maximizes the power. Our recommen- 
dation for the choice of the threshold parameter is based again on the idea 

of Storey (2002, 2003) for enhancing the power of Simes statistic for testing 

(i) — 

the constructed set of hypothesis testing problems Hq :9i = X.. , i = 1, . . . , a, 
where X.. is the overall sample mean. The p- value for each hypothesis is ap- 
proximated by 



P i = 2(l-$(\Z i \)), i = l,...,a, 



ORDER THRESHOLDING 



21 



Table 10 









Power calculations 


in Exampl 


s 4-.2 








1 opt 


-F1000 


Tt (20) 


Tl (50) 


T L (100) 


T L (250) 


T L (500) 


Tl(IOOO) 


" 1 


20 


0.7680 


0.9978 


0.9886 


0.9657 


0.8877 


8089 


0.7769 


11 2 


19 


0.7275 


0.9968 


0.9861 


9550 


8603 


0.7732 


0.7366 


H > 


18 


0.7241 


0.9960 


0.9842 


9533 


8563 


0.7669 


0.7330 


H. 
J J 4 


17 


0.6278 


0.9893 


0.9640 


0.9048 


0.7740 


0.6731 


0.6394 


iir, 


16 


6253 


0.9886 


0.9624 


9052 


0.7702 


0.6730 


0.6373 


Hi 

J J 


15 


0.6188 


0.9892 


0.9624 


0.9031 


0.7681 


0.6667 


6306 


#7 


11 


0.6011 


0.9872 


0.9577 


0.8891 


0.7464 


0.6462 


0.6119 


ffs 


13 


0.5871 


0.9872 


0.9519 


0.8829 


0.7369 


0.6337 


0.5982 


#g 


12 


0.5831 


0.9870 


0.9530 


0.8819 


0.7406 


0.6342 


0.5962 


Hu) 


11 


0.5614 


0.9849 


0.9467 


0.8730 


0.7151 


0.6097 


0.5750 


H u 


10 


0.4476 


0.9526 


0.8704 


0.7600 


0.5872 


0.4900 


0.4598 


H\2 


9 


0.4009 


0.9411 


0.8435 


0.7224 


0.5399 


0.4405 


0.4121 


H\z 


8 


0.2521 


0.7461 


0.5879 


0.4612 


0.3297 


0.2770 


0.2612 


Hu 


7 


0.2495 


0.7446 


0.5843 


0.4573 


0.3319 


0.2742 


0.2597 


His 


6 


0.1831 


0.6204 


0.4465 


0.3361 


0.2419 


0.2026 


0.1913 


Hie 


5 


0.1820 


0.6119 


0.4411 


0.3383 


0.2407 


0.2014 


0.1898 


H\7 


1 


0.1283 


0.4346 


0.2941 


0.2197 


0.1613 


0.1412 


0.1356 


His 


3 


0.1296 


0.4389 


0.2959 


0.2195 


0.1654 


0.1434 


0.1363 


Hig 


2 


0.1195 


0.4202 


0.2793 


0.2084 


0.1515 


0.1308 


0.1258 


H20 


1 


0.1176 


0.4207 


0.2763 


0.2041 


0.1532 


0.1296 


0.1238 



with 




where Xi. is the sample mean from the ith group and Sp 1 is the pooled 
sample variance. The power-enhanced version of the Simes statistic 

T s = min {aP^/i} 

l<i<a 

rejects the global null hypothesis if T$ <a/(l — ka Pt /a), with 

(4.12) K pt (X) = max j aG ^^ X - 1 , log 3 / 2 a}, 

where G a is the empirical cdf of P a = (Pi, . . . , P a ), P^ <■•• < P^ are the 
ordered Pj's, and A is the median of the Pj's. 

The simulation results shown in Table 11 suggest that the power of 
7l(/c°qq ) is similar to that of Tx,(/e°qq ). These results are based on 2000 
simulation runs; the type I error rate of Il(A;°qq ) was 0.048. 
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Table 11 
Power calculations in Example J^.l 







Hz 


Ha 


«4 




#6 


«7 




H<> 


Hio 


le° pt 

_ioqo 

^i(^iooo) 


20 
1.000 


19 
0.996 


18 
0.993 


17 
0.994 


16 
0.995 


15 
0.987 


14 
0.980 


13 
0.981 


12 
0.938 


11 
0.904 




Hn 


-Hi 2 






His 


H±e 


H17 


His 


Hig 


H-20 


_ioqo 

2i(fciooo) 


10 
0.911 


9 

0.900 


8 

0.883 


7 

0.774 


6 

0.722 


5 

0.674 


4 

0.661 


3 

0.398 


2 

0.182 


1 

0.175 



5. Discussion. The asymptotic theory of test statistics based on hard and 
soft thresholding pertain the normal distribution and require the threshold 
parameter to tend to infinity at a strictly prescribed rate. This second feature 
results in potentially compromised power of the hard threshold statistic. 

Order thresholding, a new thresholding method based on order statistics, 
is proposed. The asymptotic theory, developed under the normal distribution 
in this paper, allows great flexibility in the choice of the threshold parameter. 
A data-driven choice of the order threshold parameter is given. An extension 
to a one-way HANOVA setting is presented. Simulation studies with normal 
data suggest that order thresholding can have great power advantage over 
hard thresholding. Additional simulations with data generated under a one- 
way HANOVA design suggest even larger power gains over the traditional 
ANOVA F-test. 

Applications of the order thresholding approach to testing for the uniform 
distribution, and to multiple testing problems will be pursued in a follow-up 
paper. 

APPENDIX A: PROOF OF THEOREM 3.1 

The proofs of the present lemmas can be found in the archived supple- 
mental material in Kim and Akritas (2010). 

A.l. Some auxiliary results. 

Lemma A.l. Let Ui tU , i = 1, . . . ,n, be order statistics from the uniform 
distribution in (0,1), and set Vi jU = — log(l — Ui n ). For any < e < 1 and 

some 1 - log(n - yj\ log(f ) + 1)/ log n < 5(n) < 1 - log(f log(f ))/(2 log n), 



ORDER THRESHOLDING 



23 



set 



u jn (e) 



ui n {e) 



max< 0, 



n 



2n 



log 



f2/e 



n l-S(n) <j< n> 



J-l 



1 



2n 



58 \ 



1 < j < n 



1-S(n) 



n 

_ 1 - e~^ n e~V^, n 1 ' 5 ^ <j<n, 

where i)j n = Yli=i l/{n — i + l). Then, the sequences of constants 
v jn (e) = - log(l - u jn (e)), v^(e) = - log(l - v? n ( e )) 

satisfy 

(A.l) P{v jn {e) < V j>n < vi n (e), 1 < j < n} > 1 - e, n > 1. 

Lemma A. 2. Let Uj n {e) and u 3n (e) be given in Lemma A.l. Then, the 
sequences of constants Uj n (e) and u 3n {e), j = 1, . . . ,n, satisfy the relation 



Ujn{E) < 



J 



n + 1 



< u jn (e). 



Remark. Assume that 1 — n~ & ^ — > 0, as n — > oo. Then, the sequences 
of constants Uj n (e) and u jn (e), given in Lemma A.l, satisfy the relation 
sup Kj - <n (tf"(e) — Uj n (e)) = o(l) (cf. Glivenko-Cantelli theorem). 

Remark. If we take all Uj n {e) and u jn (e) from the Kolmogorov's in- 



equality, then Uj n {e) = 1 — e~^ n+ v 2 / £ and u ]n {e) = 1 — e~ Vjn ~v 2 / £ ; j 
l,...,n. Under these settings, sup 1<J<n (u jri (e) — Uj n (e)) ^ o(l). Also, the 
positive function R(j), defined in Lemma A. 4, is not increasing on 1 < j < n. 

Lemma A. 3. Let Vj n be given in Lemma A.l, and let Uj n = — log(l — 
j/(n +1)), j = 1, . . . ,n. Assume that 1 — n~ 5 ^ — > and n 1 / 2 (l — n~ 5 ^) — > 
oo, as n — » oo. Then, the sequences of constants Vj n {e) and v^ n (e), given in 
Lemma A.l, satisfy the relations 

Vjn(£) <Vj n <Vj n <vi n {e) and v jn (e) — Vj n (e) < K(e), 

where K(e) is independent of n. 

Lemma A. 4. Let v yn {e) and v yn (e), 1 < y < n, be given in Lemma A.l, 
and let H = F^ 1 o G, where F is the central chi-squared distribution func- 
tion with 1 degree of freedom and G is the standard exponential distribution 
function. Then, 
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1. H' is increasing, positive, concave, and H 1 (v) 2 , as v — >oo. 

2. H" is a decreasing positive function, and H"{v) — > 0, as v — >■ oo. 

3. H(v)H"(v) ^0, as v^oo. 

4- - e ~ v ) ->• °; «s v ^ 0, and -»• 0, as u -»• oo. 

H"[v) H"(v) 

5. Assume that 1 — n~ 5 ^ — > and n x / 2 (l — n~ 5(n )) — > oo, as n — > oo. The 
positive function 

R(y) = ( V y n (e) - v yn {e))H"(v yn (e)) I _ V 

y n y -\- 1 

is increasing on 1 < y < re 1- ^™). Moreover, for sufficiently large n, R(y) 
is also increasing on n 1_5 ( n ) <y <n. 



A. 2. Proof of Theorem 3.1. We need to check Assumptions A, B and C of 
CGJ1967. We use the original forms of Assumptions A and C (restate below 
for convenience), but a slightly stronger version of Assumption B. [Note 
that the simultaneous bounds of the exponential order statistics, Vj n (e) and 
v jn (e) used in Assumption B, are different from those in CGJ1967.] 

Assumption A: H(v) is continuously differentiable for < v < oo. 

Assumption B: For each e > 0, 



j=n-k n 



sup \H'(v) — H {p. 

Vj n (e)<v<vi n (e) 




= o(na n (k n )), 

where Vj n (e), v jn (e), and Dj n are given in Lemma A.l. 
Assumption C: m.ax.\<j< n \a.j n {k n )\ = o{n l / 2 a n (k n )). 

Assumption A is clearly satisfied. To verify Assumption C, use Lemma 
A. 4(1) to write 



maxi<j< n |qj Tt (fc Tt )| 
\fna n {k n ) 



(A.2) 



< 



E"=l«|n( fc n) 
2 



Suppose first that k n /n — > as n — > oo. Then 



{H'(v jn )} 2 > k n {H\D n ^ kn+hn )y 



oo 



as n - 



oo, 



j=n-k n +\ 
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so that (A. 2) tends to zero and Assumption C is satisfied in this case. Next, 
suppose that k n /n — > r as n — > oo , for some < r < 1. Using the approxima- 
tion (2.9) of CGJ1967, that is, i>j n ~ i/j n = — log(l — j/(n + 1)), it follows 
that 



n 



n 

j=n-k n +l 



r«0 



^\^t( ^ n ~^ n \i(l ^ )(F 

In + l > n + 1 /I V n + l) ' \n + l 



-iv 



i r h-t) ] 2 

^ I(t > 1 - r) | ^ ^ } dt>0 asn^oo, 



so that Assumption C is also satisfied. To show Assumption B, we use Lem- 
mas A. 4(1) and A. 3 to write 8up v . n r e \ <v<v j n r e \\H'(v) — H'(vj n )\ < H'(v jn (e)) - 
H'{v jn {e)) = (v^(e) - v jn {e))H''(v jn (e)), v jn (e) G (v jn (e),v^(e)). Thus, 



n(j n (k, n 



< 



1 n 

— y 



(A.3) 



j=n-k n +l 



{(v^(s)-v jn (e))H"(v jn (e))} l 



n — j + 1 



E 

=n— fen+l 



where the inequality is justified by the fact that H" is a decreasing positive 
function [Lemma A. 4(2)]. We need to prove that (A.3) tends to zero as 

n — > oo. Suppose first that k n /n — > 0, as n — > oo. Divide numerator and 

l I'l 

denominator of (A.3) by k n and consider first the numerator. Then, 



E 



{(t^"(£) -«,„(£))£>,„(£))}, 



j=n-k n +l 



n 



J + l 
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s Jnk, 



n 

= y 



j=n-kn + l 



H"(v jn (e)) 



n — j + 1 



1 



y 

n J= n-k n +l 



1 



{(^»( e )- ViB ( e ))fl"(« iB ( e ))} 



+ 








1 


+ 





E 

J=n l-<5(n) 

n 

v n k n ^ 

j=n-k n ' +1 

otherwise, 



n — j + 1 
{(«J»( e )- WiB ( e ))fl"( WiB ( e ))} 



n — j + 1 



{(v^(e)-v jn (s))H"(v jn (s))} 



n — j + 1 



with 1 - log(n - V§ log(f ) + 1)/ log n < 5(n) < 1 - log(n - c £ n 3 / 16 kn /8 + 



1 ) / log n [This range is applied only when k n > y l| log )] . Assume that 1 — 
n -<5(«) _^ o, nV 2 (l - n"^™)) oo, and n l / 2 {l - n'^f/ 2 d for some d > 

1 v^n 



0, as n -> oo. If fc n < ^§ log(f ), then ^= ^^^^"(^(e))^-^] < 

(i log(^)) 1/ ' 4 n 1 / 4 ^"(t; nn (e)) — )• 0, as n — > oo. This inequality is justified by 
Lemma A. 4(5), and the fact that n 1//4 .ff" '{v nn {e)) tends to zero. Suppose 



that k n > y f log(^) and k n /n ->■ 0, as n — > oo. Set a n = n 1 5 ^ - 1 and 
b n = n — kn^ . Using Lemmas A. 4(5) and A. 3, we have 



!nkr. 



E 



j=n-k n +l 



{(v^(e)-v jn (e))H"(v jn (e))} 



n — j + 1 



(A.4) 
(A.5) 



< 



kn 



(l_ n -«5(n)) 



— (v^(e)-v an , n (e))H"(v an , n (e)) 



1/ -n 3 / 16 ^^,^,)) + J° £ k-^H"{v nn {e)). 



Since (n — a n + l)/?! 1 / 2 — > oo as n — > oo, using a one-term Taylor expan- 
sion we have - ^ W « ^V^g^ = 0( ^ (1 J._ W .„ ). 
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Thus, we have 

/ 1 k 1/2 ~ \ 

(A - 4) = °( v n i/2(i_ re ^(n) ) 3/2 ■^TJ2 H "( V ^))J ^0 asn^oo, 

where it is justified by Lemma A. 4(2) and the fact that v an ^ n (e) tends to 
infinity. From Lemma A. 4(2), the second term of (A. 5) tends to as n — > oo. 
Moreover, the first term of (A. 5) tends to as n — > oo (even b n = n — n 1 / 4 ). 
Since also (£ £ n J=n _ kn+l {H' (v 3n )} 2 )~ l l 2 < (H'(u n - kn+1 , n ))~ l < oo, (A.3) 
tends to zero and Assumption B is satisfied when k n /n — > as n — > oo. 
Next, we suppose that for some < r < 1, k n /n — > r as n — > oo. Divide 
numerator and denominator of (A.3) by ra 1 / 2 and consider the numerator 
and denominator separately. Since 



1 - 

1 E 

n ^ 

j=n-k n +l 



{(v^(e)-v 3n (e))H"(v jn (e))} 



n — j + 1 



^„{ H"(v nl - S( n)_ 1>n (e)) \ [8 3 /i6£V/// / \ \ 



+ ^-n- 1 ' i H"{v nn {e)) 
— » as n — > oo, 

which can be obtained by breaking up the summation first for j = n — 
k n + 1 to n 1 " ^") - 1, n 1 " 5 ^ to n - n 1 / 4 , and lastly n - n 1 / 4 + lton with 

l-log(n- log(^) + l)/logn< 5(n) < 1 - log(n - n 13 / 16 + l)/logn, and 

(n Y^j= n -k n +i{H' (vjn)} 2 )' 1 ^ 2 < oo, the term (A.3) converges to as n — > oo 
in this case. Thus, Assumption B holds for both cases. Since Assumptions 
A, B and C of CGJ1967 are satisfied, the proof is done. 

APPENDIX B: PROOF OF THEOREM 4.1 

The proofs of the present lemmas can be found in the archived supple- 
mental material in Kim and Akritas (2010). 

B.l. Some auxiliary results. 

Lemma B.l. For any < e < 1 and some 5(a) which satisfies 1 — log(a — 
A /|log(f) + l)/loga < 5(a) < 1 - log(§ log(f ))/(21oga), 1 - a~ s ^ -> 0, 
a x / 2 (l - a~ s ^) — > oo, and a l / 2 (l — a"* 5 ^) 3 / 2 — > d for some d > 0, as a— >oo, 
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let 



max 



Uja{e) 



ui a {e) = < 



l-e Ja e 



1< j<o 1-tf (°), 

a l-S{a) < j < Q> 



J-l 



+ 



1 < j < a 1 " 5 ^) , 
a l-*(o) < j < fl; 



,l-e-^e'V 2 / £ , 

where i)j a = X^i=i V( a ~~ * + 1)> an ^ set 

v ja (e) = - log(l - u ja (e)), v* a (e) = - log(l - u ja (e)). 

For any M > 0, Zei H a> M{ v ) = G~ M (1 — e~ v ), where G a ^M is the noncentral 
chi-squared distribution function with 1 degree of freedom and noncentrality 
parameter M 2 /a. Then, 

1. H' a M is bounded and H' a M {v) — s- 2 ; as v — > oo and a — > oo. 
2- H'l M (v) = B aM {v) - (H' a M {v)) 2 J aM {v), where 



Ja,M(v) 



2 2H aM {v). 
T.t=i{{MVa) k (H aM {v)) k ~ 1 /(2 2k (k - l)\T{k + 1/2))} 



ET=o{(M 2 /a)HH a , M {v)) k /{2^k\T{k + 1/2))} 

Note that B a ^j is a decreasing positive function, B a ^M{ v ) as v — >■ oo 
and a— >oo, and J a ,M is bounded by M 2 /(2o). 
3. The positive function 



Rm(v) = (v ya {e) - v ya (s))B aiM (v ya (s))J 



a-y + 1 

is increasing on 1 < y < a 1-5 ^ . Moreover, for sufficiently large a, Rm(u) 
is also increasing on a 1 "* 5 ^) <y < a. 

Lemma B.2. Consider the setting of Lemma B.l. Let g a fi and g a ,M 
be the density functions of Xi(0) and x({M 2 /a), respectively. Set y a ^ = 
H afi (v aa (e)) = G-i(l-e-^( £ )) andy aM = H atM (v a a(e)) = G^l-e"^)). 
Then, 

1. y a ,M is bounded by (21og(a + 1) - 21og( v / ^72) + 21og(e M /( 2 v / «) + 

e -M/ (2>/a)))2 _ 

2- g a ,o(ya,M) 1 9a,M{ya,M) -> 1 a^^o^A/) / ' 9a,M{y a ,M) - 1) ->■ 0, as 

a — > oo. 
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3 - g< "° ( ^r^) yn,M) ~ -(^f ±1 )C C 'a ) M ; ^ere C , M is defined in the proof. 
In particular, C ayM = 0(y a ,itf)- 

4. a 1/4 (y a ,o/ya,M - 1) -)■ and a 1/4 (y aj0 (ya,o)/Sa,o(ya,M) - 1) ->• 0, as a ->■ 
oo. 

5. a 1/4 (e-^( £ )/y a , (ya,A/)-2 + 2/y ai0 )^0, as a ^oo. 

6. a 1 / 4 (e-"-( £ )/ 5o>M (y 0;M ) - 2 + 2/y a , M ) ^ 0, as oo. 

Remark. For any M > 0, we write H' aM (v aa (e)) = ^"J^,) and 
£>a M(t>aa(£)) = o — 7 \ ( — — 7 t — 2 H £ — t r • — — ). From Lem- 

a,M \ aa\ /j 2g a! M(y a ,M) v ffa,MU/a,Af) g a ,M{y a ,M) Va,M 1 

mas B.l(l), B.l(2) and B.2(6), we obtain that a l/A B aM (.v aa (e)) ->■ 0, as 
a — > oo. 

Lemma B.3. Consider the setting of Lemma B.2. Let b a = a — k]j^ with 



k a > VlMf )• Set x afi = H a>0 (v bata (e)) = G~l(l - e -«*-.-( s )) and x a , M 
ffa,JVfK,a(£)) = G~ 1 M {1 - e~ v ^^). Then, 



1. ^a,M < (21og(a + 1) - 21og(A; a /4 + 1) - 21og(7^72) + 21og(e M /( 2 v / ") + 



e -M/(Vo)))2. 



2- g a ,o(Xa,M)/ga,M( X a,M) ~> 1 and a3/16 (fl'o.O (Xo,,m) / 9a,M (x 0) Af) ~ 1) -> 0, as 

a — )• oo. 

9a,o(^a,Af) v 2 / 2a a,iw> 

Ira particular, C' a M = 0(x a ,M)- 

4. a 3/16 (x ai o/:Ea,A/ - 1) -> and a 3/16 (ya,o(^a,o)/ya,o(^a,M) - 1) -> 0, asa-> 
oo. 

5. a 3 / 16 (e-^, a (=)/« 7a ( a ; aiM ) - 2 + 2/x afi ) -> 0, as a -> oo. 

6. a 3 / 16 (e-^.«( £ )/ 5a , M (x a) M) - 2 + 2/x a , M ) -> 0, as a -> oo. 

Remark. From Lemmas B.l(l), B.l(2), and B.3(6), we obtain that 
a 3 / ie xB a M(v ,i/4 (e)) — > 0, as a — > oo. 

B.2. Proof of Theorem 4.1. For simplicity, let H a , t (v) = - e _,; ). 

Then 

j a ^ a 

a '«<« = ^m|>V t ( G i(i-e->,)) = ^TTT g 
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and 



1 



1=1 

Let us check Assumptions A, B and C of CGJ1967, which we restated in 
the proof of Theorem 3.1. For given any \t\ < M, Assumption A is clearly 
satisfied. Next, it is easily verified that for any fixed values of a and v, H' at (v) 
increases as \t\ increases. Thus, Oi\ a (k a ) and cr*(fc ) increase as \t\ increases. 
Let us check Assumption C: for given any \t\ < M, 

max 1 < j < a |a* a (/c a )| max 1 < J < a |ajf(fc )| 
sfao^ka) ^/aa Q a {k a ) 

max a _ fc „ + i<j< a H' aM {D ja ) 

— > as a — > oo, 
provided that k a — > oo, as a — > oo. It is justified by the facts that 

max H' M {vja) 

a—k a +l<3<a 

is bounded [Lemma B.l(l)] and YTj=a-k a +i^'afl{^ja)} 2 — > °o as k a tends to 
infinity with a. (It was shown in the proof of Theorem 3.1 because it becomes 
the central chi-square case when t = 0.) In order to verify Assumption B, it 
suffices to show that 



Ei=a-fc a +l [{ SU P^ a ( £ )< ^<^( £ )l g a,M(^) ~ ff a,A/(^a)l}y / i/( Q ~ J + _ , . 

z 

where Vj a (e), v ja (e), and Vj a are given in Lemma B.l. Using Lemma B.l(2), 
we write 

sup \H' a>M (y) -H' aM (v ja )\ 

v ja (e)<v<vi a {e) 

<(v^(e)-v ja (e))-\H'l M (v* a )\ 
<(v ja (e)-v 3a (e))B aM {v ja (e)) 

+ (v^(e) - v ja (e))(H' ajM (v* a )) 2 M 2 /(2a) 
with some v* a £ (f j a (e), f ja (e)). From the above inequality, we have 

J2 a j=a-k a +l [{suPv ja (e)< v<vi"(e)\K,M( v ) ~ K M {j>ja)\} y/ j / (a - j + 1)] 

«Ei=a-fc o +l{^a,0(^)} 2 
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(B.l) < 



V^Y*= a -k a+ Mv Ja {e) - v ja (e))B a , M (v ja (e))}^j/(a - j + 1)] 



+ 7r„ E 



(B.2) 



j=a-k a +l 



,M 2 



( v ^(e)- Vja (e))(H' a<M (v*J) 2 ^ 



2a J y a — j + 1 



E {Ko(^)¥ 

\ j=a-k a +l 



To show that (B.2) tends to zero, we use Lemmas B.l(l) and A. 3 to write 



k 



-1/2 



(B.2)<C e -^- {Ko(^)} 2 ) for some < C e < oo. 



\j=a— fea+1 / 

Suppose first that k a /a — > as a — > oo. Then 



(^•3) £ {^,o(^)} 2 ) 



-1/2 



< 



H'o(Va-k a + l,a] 



< OO, 



so that (B.2) tends to as a — > oo. For some < r < 1, if k a /a — > r as a — > oo, 
then 



(B.4) 



-1/2 



< oo. 



j=a-k a +l 



Thus, (B.2) tends to as a — > oo in both cases. Since (B.2) converges to 
zero, the remaining part is to prove that (B.l) tends to 0, provided that 

k a — > oo as a — > oo. Suppose first that k a < | log(^). Divide numerator 

1/2 

and denominator of (B.l) by k a and consider first the numerator. From 
Lemma B.l (3), we have 



1 



'ak 



E 



a j=a-k a +l 

< 



{(v^{e)-v ja (e))B aM {v ja {e))}, 



a- j + l 



Using Lemmas B.l(l), B.l(2), B.2(6) and (B.3), the term (B.l) tends to 
zero and Assumption B is satisfied in this case. Next, we suppose that k a > 
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§ log(^) and k a /a^0, as a —> oo. Then, from Lemmas B.l(3) and A. 3, 



iak a 



E 



j=a-k a +l 



{{v ja {e)-v ja {e))B ay M{v ja { e ))}' 



a-j + 1 



< 



(v al Ha) 1,a (e)- V-'C«)-i,o( e ))- B o,Jlf(V-«(«)-i,o( e )) 



y a(l-a~ s ( a )) 



where 1 - log(a - y'f log(f ) + l)/loga < 5(a) < 1 - log (a - c e a 3 / 16 A;a /8 + 
l)/loga. From Lemmas B.l(l), B.l(2), B.3(6), (B.3), and the fact that 
v al ~ S(a) - 1 ' a (e) - V-«C«)-i, ( e ) = °( a i/2 {l l a s {a)) ), the term (B.l) also tends 
to zero and Assumption B is satisfied in this case. Lastly, we suppose that 
for some < r < 1, A; a /a — ?■ r as a — > oo. Divide numerator and denominator 
of (B.l) by a 1 ' 2 and consider the numerator and denominator separately. 
Since (B.4) and 



1 



E 



j=a— k a +l 
< 



{(v ja (e)-v ja (e))B aM (v ja (e))}. 



a-j + 1 



K 1 Ha) 1,a (e)-v-w-i,«( £ )) B «,MK>~*w-i, a ( £ )) 



— >• as a — )• oo 



with l-log(a- Jf log(^) + l)/loga<5(a) < 1 - log (a - a 13/16 + l)/bga, 

the term (B.l) converges to as a — > oo in this case. Thus, Assumption 
B holds as k a tends to infinity with a. Since Assumptions A, B and C of 
CGJ1967 are satisfied, the proof is done. 

APPENDIX C: PROOF OF THEOREM 4.2 
C.l. Proof of Lemmas 4.2-4.4. 

C.l.l. Proof of Lemma 4-2- We observe that 
VaQl(ka) 
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where V% are i.i.d. random variables with the distribution function G(v ) = 
1 - e~ v , v>0. Note that g(v) = e~ v , v > 0, £(VS - 1) = 0, Var(^ - 1) = 1, 
and E(\Vi - 1| 3 ) = 12/e - 2. Let 

a a 

s 2 a = Var(a* a (fc a )(F 4 - 1)) = J>* a (*; a )) 2 = a(a* (k a )) 2 , 

i=l i=l 

Pi =J2E(\ata(kaW - 1)| 3 ) =(--2) 1 £2 W\a(ka)\\ 
i=l \ e / i=1 



Using Berry-Esseen theorem of Galambos [(1995), page 180] we have 
sup IS* (xs a ) — &(x)\ < 0.8r a as a oo, 

— oo<x<oo 

where 5* is a distribution function of Yli=i a L(^a)(^ ~~ 1) anc ^ ^ is a stan- 
dard normal distribution function. Thus, we have 

sup \Ha,t(x) - $(x)\ 

-M<t<M 
—oo<x<oo 

= sup |S*(xv^fT*(fc a )) -$(x)| 

-M<t<M 
— oo<x<oo 

^ n c f 12 <^ / maxi< i<a |a* a (fc a )| \ 

< 0.8 2 sup < ~ ~ > — > as a — > oo, 

V e / —M<t<M I V acr a{ k a) J 

provided that A; a — > oo, as a — >• oo. 

C.1.2. Proof of Lemma 4-3. For convenience, we rewrite 

Ri(ka) = l £ {(^,a-%a)G* (^ )}, 
j=a-fc a +l 



where 

Ha,t(w) - H att (D ja ) 



H' a ,t(Pja), tfv^V ja , 



0, if V = Uja- 

Let fja( £ ) = sup„ ja(e)<1)< ^a (e) |G* a (i;)|, where u io (e) and v ]a (e) are given in 
Lemma B.l. Then, we have 



(C.l) pl\Ri{k a )\<- a J2 sfalfWi 
V. j=a-k a +l 



a - v ja \ for all |i| < M \ > 1 - e. 
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It follows from (C.l) that 



f 1 a 

P{ sup \Rl(k a )\<- Y, 
\-M<t<M Vft + i 



gf a {e)\V j>a -i> ja \\>l-e. 



From Assumption B and Proposition 2 of CGJ1967, we have 
Ej= a - ka +i9ji( £ ) x \Vj,a ~ Vja\ = o p (^Ea^(k a )), so that 
sup_ M<t<M \^/aR t a (k a )\ = o p (a^ I (k a )). Also, it is easily verified that 
a^ 1 (k a )/ a®{k a ) = 0(1) (Lemma 4.5), provided that k a — > oo as a — > oo. Con- 
sequently, 



sup 

-M<t<M 



O-a(Ka) -M<t<M ^ a [k a ) 



C.l. 3. Proof of Lemma 4-4- We have already proved that for given any 
\t\ <M, 



Tl (k a ) 



+ ^/aRJM 4jv( 01 ) asa^oo (Theorem 4.1) 

provided that k a — > oo, as a — > oo. From Lemmas 4.2 and 4.3, we have 
sup \F a j(x) — &(x)\ — > as a— >oo, 

— M<t<M 

— oo<x<oo 

provided that k a — > oo , as a — > oo . 

C.2. Proof of Theorem 4.2. For any given 5\ > 0, there exists M > 
such that 

P(\t\>M)<5 1 . 
From Lemma 4.4, any given 82 > 0, there exists ao such that 

\P{f* L {k a ) < x\\t\ < M) - < 5 2 for all a > a . 

Thus, we have 

\P{ft(k a ) < x) - $(x)\ < \P(f* L (k a ) < x\\t\ < M) - 

+ P(\t\>M)<8 1 + 5 2 
for all a > ao- Take e/2 = max{(5i, #2}. Then 

\P(fl(k a )<x)-^(x)\<s foralla>a . 
Thus, provided that k a — > 00, as a — > 00, we have 

?£(fc o )4-iV(0,l) as a ^00. 
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APPENDIX D: PROOF OF THEOREM 4.3 
D.l. Proof of Lemmas 4.5 4.7. 

D.l.l. Proof of Lemma 4-5. We need to show that 



sup 

-M<t<M 



0", 



0", 



a\ n a 



a\ n a 



as a — > oo, 



provided that k a — > oo as a — > oo. Suppose first that k a /a — > as a — > 
oo. Since \/a/k a a^(k a ) > 0, it is enough to show that sj a/k a (a^ \k a ) — 
°a{k a )) -»• 0, as a ^ oo. We first have a(a^(k a )) 2 < (k a (2 - 
k a /a)){m&x a _ ka+1 <j< a H' a M (t>ja)} 2 and a(a° a (k a )) 2 > (k a [l + k a (a- k a )/((a + 
i)(k a + i))]){K o(^a-fc a +i,a)} 2 - Consequently, we obtain 



— V 2 — ^ I t D K$^ ^,A/(^a)} 

V a la— ft a +l<j<a J 



-4/1 + 



(a + l)(fc a + l) 



)}^0 



as a — > oo. Next, we suppose that for some < r < 1, /c a /a — >• r as a — > oo. 
Then a^(k a ) > 0, so we need to prove that (k a ) — o~®(k a ) — > 0, as a — > oo. 
We observe that 



{a^{k a )f - (a° a (k a )) 2 

a-k a / 1 x 2 



1 



a * — ' \ a — i + 1 

i=l 



E KmO 

\j=a-k a +l 



30.) 



E Ko(*. 

\j=a-k a +l 



.I'll 



+ 



a 

7, £ 



i=a— fca+1 



^TiE^m(%«)) 

j=i / 



max M {vja) - H' o(u ja )) 

a—k a +±<j<a 



as a — >• oo. 
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Since (af {k a )f - (a° a (k a )f = (a A J(k a ) + a° a (k a ))(a A J (k a ) -a° a (k a )), we have 
a ^(k a ) ~ v° a (k a ) -> asa-^oo. 

D.1.2. Proof of Lemma 4-6. We hope to show that 

VaMka) ~ Va(ka)) 



sup 

-M<t<M 



> U as a — > oo, 



provided that k a — > oo, as a — > oo. From the fact that G~ M (1 — e~ Uia 
G~J(1 — e~ Uia ) is increasing in i and Taylor's expansion, we have 



i=a— k a +l 



Note that the last equality is justified by the similar argument of the proof of 
Lemma B.2(3). Applying the same argument of the proof of Lemma B.2(l), 
it follows that 

Galii 1 -^) < (21og(a + l)-21og(^72) + 21og(e M /( 2 ^) + e - Af /( 2 ^))) 2 . 



Suppose first that k a /a — > as a — > oo. Since a/k a a^(k a ) > 0, it is enough 
to show that 

(D.l) (/^(*a)-A«2(*a))->>0 asa^oo. 

V "-a 

Since y ■ a ' M ^ > as a — > oo, (D.l) is satisfied. Next, we suppose 

that for some < r < 1, k a /a ->r as a-> oo. Then, a^{k a ) > 0, so we need 
to prove that 

(D.2) v^(/4 f (M-/^a))^0 asa^oo. 

j, G _1 (1— e _s<la ) 

Since ^ • a,M y= y0asa->oo, (D.2) is also satisfied. 

D.l. 3. Proof of Lemma 4-7. Suppose that k a /a — > r, < r < 1, and k a — >• 
oo, as a oo. Then 

i=a-fc a +l i=l v 7 V 1 / 

H /'OO 

— >■ / L{t > \ — r)G~\(t) dt = I ug a o(u)du as a — >-oo. 

i0 ^G-^l-r) 
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Note that if r = 1, p? a (k a ) — > 1, asa-> oo. Also, we have 

(a° a (k a )) 2 * \yy\i( > a -^]i( J- > 



j=l 1=1 



X 



a+l) \ a + 1, 

min{j/(a + l),Z/(a + l)} 
l-min{j/(a + l),Z/(a + l)} 

1 1 

X ffa,o(G-J(i/(a + 1))) g a ,o(G-i(l/{a + 1))) 

/ / I(i> l-r)J(s> l-r)(min(i,s) -is) 
Jo io 

1 1 

dt ds 



= f [ I(t>l-r)I(s>l-r)(min(t,s)-ts)dG-l(t)dG-i(s). 
Jo Jo 

Note that if r = 1, <7a(fca) — >• \/2, as o — > oo. 

D.2. Proof of Theorem 4.3. From Theorem 4.2, Lemmas 4.5, 4.6, 4.7 
and Slutsky's theorem, it follows that 

f L (k ) = ^ ka )~ a ^a) 



aa°(k a ) 

iV( 0,1+ 9 2/ir - ) asa^oo. 
at{n-l)J 
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SUPPLEMENTARY MATERIAL 

Supplement to "Order Thresholding" (DOI: 10.1214/09-AOS782SUPP; 
.pdf). We prove Theorems 3.1, 4.1, 4.2 and 4.3 of the paper "Order Thresh- 
olding." A number of auxiliary results that are needed for these proofs are 
also stated and proved. 



38 



M. H. KIM AND M. G. AKRITAS 



REFERENCES 

Akritas, M. G. and Papadatos, N. (2004). Heteroscedastic one-way ANOVA and lack- 
of-fit tests. J. Amer. Statist. Assoc. 99 368-382. MR2062823 

Beran, R. (2004). Hybrid shrinkage estimators using penalty bases for the ordinal one- 
way layout. Ann. Statist. 32 2532-2558. MR2153994 

Bickel, P. J. (1967). Some contributions to the theory of order statistics. In Proc. Fifth 
Berkeley Symp. Math. Statist. Probab. 1 575-591. Univ. California Press, Berkeley, CA. 
MR02 16701 

Chernoff, H., Gastwirth, J. L. and Johns, M. V. (1967). Asymptotic distribution 
of linear combinations of functions of order statistics with applications to estimation. 
Ann. Math. Statist. 38 52-72. MR0203874 

David, H. A. and Nagaraja, H. N. (2003). Order Statistics. Wiley, New York. 
MR1994955 

Donoho, D. L. and Johnstone, I. M. (1994). Ideal spatial adaptation by wavelet shrink- 
age. Biometrika 81 425-455. MR1311089 

Efron, B., Tibshirani, R., Storey, J. D. and Tusher, V. (2001). Empirical Bayes 
analysis of a microarray experiment. J. Amer. Statist. Assoc. 96 1151-1160. MR1946571 

Fan, J. (1996). Test of significance based on wavelet thresholding and Neyman's trunca- 
tion. J. Amer. Statist. Assoc. 91 674-688. MR1395735 

FAN, J. and Lin, S. K. (1998). Test of significance when data are curves. J. Amer. Statist. 
Assoc. 93 1007-1021. MR1649196 

Galambos, J. (1995). Advanced Probability Theory. Dekker, New York. MR1350792 

Hall, P. (1978). Representations and limit theorems for extreme value distributions. J. 
Appl. Probab. 15 639-644. MR0494433 

Johnstone, I. M. and Silverman, B. W. (2004). Needles and straw in haystacks: 
Empirical Bayes estimates of possibly sparse sequences. Ann. Statist. 32 1594-1649. 
MR2089135 

Kim, M. H. and Akritas, M. G. (2010). Supplement to "Order thresholding." DOI: 

10.1214/09-AOS782SUPP. 
Nagaraja, H. N. (1982). Some nondegenerate limit laws for the selection differential. 

Ann. Statist. 10 1306-1310. MR0673667 
Neyman, J. (1937). Smooth test for goodness of fit. Skandmavisk Aktuanetidskrift 20 

149-199. 

Shorack, G. R. (1969). Asymptotic normality of linear combinations of functions of order 

statistics. Ann. Math. Statist. 40 2041-2050. MR0253457 
Simes, R. J. (1986). An improved Bonferroni procedure for multiple tests of significance. 

Biometrika 73 751-754. MR0897872 
Spokoiny, V. G. (1996). Adaptive hypothesis testing using wavelets. Ann. Statist. 24 

2477-2498. MR1425962 
Stigler, S. M. (1969). Linear functions of order statistics. Ann. Math. Statist. 40 770- 

788. MR0264822 

Stigler, S. M. (1973). The asymptotic distribution of the trimmed mean. Ann. Statist. 

1 472-477. MR0359134 
Storey, J. D. (2002). A direct approach to false discovery rates. J. R. Stat. Soc. Ser. B 

Stat. Methodol. 64 479-498. MR1924302 
Storey, J. D. (2003). The positive false discovery rate: A Bayesian interpretation and 

the g-value. Ann. Statist. 31 2013-2035. MR2036398 



ORDER THRESHOLDING 



39 



Department of Statistics 
Pennsylvania State University 
University Park, Pennsylvania 16802 
USA 

E-MAIL: mzkl32@psu.edu 
mga@stat.psu.edu 



